Search CORE

153 research outputs found

Early Prediction of Movie Box Office Success based on Wikipedia Activity Big Data

Author: A Halavais
A Ishii
A Spoerri
A Spoerri
Attila Szolnoki
B Suh
C Castillo
CA Hidalgo
G Eysenbach
HS Moat
J Bollen
J Ginsberg
J Ratkiewicz
J Török
János Kertész
Márton Mestyán
R Kimmons
R Sharda
RK Pan
S Saavedra
S Sinha
S Sreenivasan
T Brody
T Holloway
T Preis
T Preis
T Yasseri
T Yasseri
T Yasseri
T Yasseri
Taha Yasseri
X Shuai
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Use of socially generated "big data" to access information about collective states of the minds in human societies has become a new paradigm in the emerging field of computational social science. A natural application of this would be the prediction of the society's reaction to a new product in the sense of popularity and adoption rate. However, bridging the gap between "real time monitoring" and "early predicting" remains a big challenge. Here we report on an endeavor to build a minimalistic predictive model for the financial success of movies based on collective activity data of online users. We show that the popularity of a movie can be predicted much before its release by measuring and analyzing the activity level of editors and viewers of the corresponding entry to the movie in Wikipedia, the well-known online encyclopedia.Comment: 13 pages, Including Supporting Information, 7 Figures, Download the dataset from: http://wwm.phy.bme.hu/SupplementaryDataS1.zi

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Aaltodoc Publication Archive

Oxford University Research Archive

FigShare

Emergence of world-stock-market network

Author: Bayani H.
Jafari G. R.
Jamali T.
Kamali M. Z.
Saeedian M.
Yasseri T.
Publication venue
Publication date: 01/01/2017
Field of study

In the age of globalization, it is natural that the stock market of each country is not independent form the other markets. In this case, collective behavior could be emerged form their dependency together. This article studies the collective behavior of a set of forty influential markets in the world economy with the aim of exploring a global financial structure that could be called world-stock-market network. Towards this end, we analyze the cross-correlation matrix of the indices of these forty markets using Random Matrix Theory (RMT). We find the degree of collective behavior among the markets and the share of each market in their structural formation. This finding together with the results obtained from the same calculation on four stock markets reinforce the idea of a world financial market. Finally, we draw the dendrogram of the cross-correlation matrix to make communities in this abstract global market visible. The dendrogram, drawn by at least thirty percent of correlation, shows that the world financial market comprises three communities each of which includes stock markets with geographical proximity

arXiv.org e-Print Archive

Oxford University Research Archive

Mapping the UK Webspace: Fifteen Years of British Universities on the Web

Author: Cowls Josh
Hale Scott A.
Margetts Helen
Meyer Eric T.
Schroeder Ralph
Yasseri Taha
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2014
Field of study

This paper maps the national UK web presence on the basis of an analysis of the .uk domain from 1996 to 2010. It reviews previous attempts to use web archives to understand national web domains and describes the dataset. Next, it presents an analysis of the .uk domain, including the overall number of links in the archive and changes in the link density of different second-level domains over time. We then explore changes over time within a particular second-level domain, the academic subdomain .ac.uk, and compare linking practices with variables, including institutional affiliation, league table ranking, and geographic location. We do not detect institutional affiliation affecting linking practices and find only partial evidence of league table ranking affecting network centrality, but find a clear inverse relationship between the density of links and the geographical distance between universities. This echoes prior findings regarding offline academic activity, which allows us to argue that real-world factors like geography continue to shape academic relationships even in the Internet age. We conclude with directions for future uses of web archive resources in this emerging area of research.Comment: To appear in the proceeding of WebSci 201

arXiv.org e-Print Archive

Crossref

Oxford University Research Archive

Editorial: At the Crossroads: Lessons and Challenges in Computational Social Science

Author: Borge-Holthoefer J.
Moreno Y.
Yasseri T.
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2016
Field of study

The interest of physicists in economic and social questions is not new: during the last decades, we have witnessed the emergence of what is formally called nowadays sociophysics [1] and econophysics [2] that can be grouped into the common term “Interdisciplinary Physics” along with biophysics, medical physics, agrophysics, etc. With tools borrowed from statistical physics and complexity science, among others, these areas of study have already made important contributions to our understanding of how humans organize and interact in our modern society. Large scale data analyses, agent-based modeling and numerical simulations, and finally mathematical modeling, have led to the discovery of new (universal) patterns and their quantitative description in socio-economic systems..

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositorio Universidad de Zaragoza

Frontiers - Publisher Connector

Oxford University Research Archive

The Oberta in open access

Directory of Open Access Books (DOAB)

The Digital Flynn Effect: Complexity of Posts on Social Media Increases over Time

Author: C Wood
I Smirnov
JR Flynn
JR Flynn
JR Flynn
JW Pennebaker
MA Drouin
N Karpov
R Gunning
RF Flesch
T Yasseri
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Parents and teachers often express concern about the extensive use of social media by youngsters. Some of them see emoticons, undecipherable initialisms and loose grammar typical for social media as evidence of language degradation. In this paper, we use a simple measure of text complexity to investigate how the complexity of public posts on a popular social networking site changes over time. We analyze a unique dataset that contains texts posted by 942, 336 users from a large European city across nine years. We show that the chosen complexity measure is correlated with the academic performance of users: users from high-performing schools produce more complex texts than users from low-performing schools. We also find that complexity of posts increases with age. Finally, we demonstrate that overall language complexity of posts on the social networking site is constantly increasing. We call this phenomenon the digital Flynn effect. Our results may suggest that the worries about language degradation are not warranted

arXiv.org e-Print Archive

Crossref

MAnnheim DOCument Server

Circadian patterns of Wikipedia editorial activity: A demographic analysis

Author: A Pozdnoukhov
AG West
AL Barabási
Attila Szolnoki
BA Huberman
BA Huberman
C Jonathan
DH Spennemann
DHR Spennemann
DM Wilkinson
F Ortega
F Wu
G Szabo
HH Jo
J Ratkiewicz
J Voss
János Kertész
M Karsai
R Sumi
R Sumi
Robert Sumi
S Javanmardi
S Panda
T Holloway
T Yasseri
Taha Yasseri
TK Park
Y Wu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

Wikipedia (WP) as a collaborative, dynamical system of humans is an appropriate subject of social studies. Each single action of the members of this society, i.e. editors, is well recorded and accessible. Using the cumulative data of 34 Wikipedias in different languages, we try to characterize and find the universalities and differences in temporal activity patterns of editors. Based on this data, we estimate the geographical distribution of editors for each WP in the globe. Furthermore we also clarify the differences among different groups of WPs, which originate in the variance of cultural and social features of the communities of editors

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Aaltodoc Publication Archive

Dynamics of conflicts in Wikipedia

Author: A Capocci
A Halavais
A Kittur
A Kittur
A Vázquez
AK Laird
AL Barabási
András Kornai
András Rung
Attila Szolnoki
B Adler
B Suh
BQ Vuong
D Laniado
D Laniado
DG Champernowne
DM Wilkinson
DW McDonald
F Ortega
F Tyers
FB Viegas
H Zha
J Giles
J Leskovec
J Ratkiewicz
J Ratkiewicz
J Ratkiewicz
J Schneider
J Voss
János Kertész
K Samson
K Smets
KI Goh
L Buriol
M Hu
M Karsai
M Potthast
M Strube
O Medelyan
P Massa
R Kimmons
R Sumi
R Sumi
RL Rivest
Robert Sumi
S Javanmardi
S Javanmardi
S Vajna
SKS Sharoff
SP Ponzetto
T Gowers
T Yasseri
T Yasseri
T Yasseri
Taha Yasseri
U Brandes
U Brandes
V Zlatić
V Zlatić
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

In this work we study the dynamical features of editorial wars in Wikipedia (WP). Based on our previously established algorithm, we build up samples of controversial and peaceful articles and analyze the temporal characteristics of the activity in these samples. On short time scales, we show that there is a clear correspondence between conflict and burstiness of activity patterns, and that memory effects play an important role in controversies. On long time scales, we identify three distinct developmental patterns for the overall behavior of the articles. We are able to distinguish cases eventually leading to consensus from those cases where a compromise is far from achievable. Finally, we analyze discussion networks and conclude that edit wars are mainly fought by few editors only.Comment: Supporting information adde

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

SZTAKI Publication Repository

Directory of Open Access Journals

PubMed Central

Oxford University Research Archive

FigShare

A practical approach to language complexity: a wikipedia case study

Author: A Halavais
A Kornai
A Mikheev
András Kornai
D van Leijenhorst
D Varga
E Gabrilovich
Eduardo G. Altmann
EG Altmann
EG Altmann
F Tweedie
GR Klare
JC Roberts
János Kertész
M Serrano
MD Besten
MK Paasche-Orlow
O Medelyan
R Baeza Yates
R Gunning
R Lambiotte
S Javanmardi
T Yasseri
T Yasseri
Taha Yasseri
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

In this paper we present statistical analysis of English texts from Wikipedia. We try to address the issue of language complexity empirically by comparing the simple English Wikipedia (Simple) to comparable samples of the main English Wikipedia (Main). Simple is supposed to use a more simplified language with a limited vocabulary, and editors are explicitly requested to follow this guideline, yet in practice the vocabulary richness of both samples are at the same level. Detailed analysis of longer units (n-grams of words and part of speech tags) shows that the language of Simple is less complex than that of Main primarily due to the use of shorter sentences, as opposed to drastically simplified syntax or vocabulary. Comparing the two language varieties by the Gunning readability index supports this conclusion. We also report on the topical dependence of language complexity, that is, that the language is more advanced in conceptual articles compared to person-based (biographical) and object-based articles. Finally, we investigate the relation between conflict and language complexity by analyzing the content of the talk pages associated to controversial and peacefully developing articles, concluding that controversy has the effect of reducing language complexity

arXiv.org e-Print Archive

Crossref

SZTAKI Publication Repository

Directory of Open Access Journals

PubMed Central

FigShare

Human-machine networks: Towards a typology and profiling framework

Author: Bravos G.
Eide A.W.
Engen Vegard
Følstad A.
Lüders M.
Meyer E.T.
Pickering J.B.
Tsvetkova M.
Walland P.
Yasseri T.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

© Springer International Publishing Switzerland 2016. In this paper we outline an initial typology and framework for the purpose of profiling human-machine networks, that is, collective structures where humans and machines interact to produce synergistic effects. Profiling a humanmachine network along the dimensions of the typology is intended to facilitate access to relevant design knowledge and experience. In this way the profiling of an envisioned or existing human-machine network will both facilitate relevant design discussions and, more importantly, serve to identify the network type. We present experiences and results from two case trials: a crisis management system and a peerto- peer reselling network. Based on the lessons learnt from the case trials we suggest potential benefits and challenges, and point out needed future work

arXiv.org e-Print Archive

Southampton (e-Prints Soton)

Crossref

LSE Research Online

ZENODO

Oxford University Research Archive

Bournemouth University Research Online